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This  paper  categorically  rejects  the  possibility  of  considering  a  word- 
to-word  conversion  as  a  translation.  A  true  translation  is  unattainable, 
even  by  the  human  agent,  let  alone  by  mechanical  means.  However,  a 
crude  practical  translation  is  probably  achievable.  The  present  paper 
deals  with  a  scheme  for  the  syntactic  integration  of  Russian  sentences. 


INTRODUCTION 

From  the  moment  that  a  writer  conceives  an  idea 
which  he  desires  to  communicate  to  his  fellow  men, 
sizable  stumbling  blocks  are  strewn  in  the  path  of 
the  future  translator.  For  the  ability  to  shape  one’s 
thought  clearly,  or  even  completely,  is  not  granted  to 
many;  rarer  still  is  the  gift  of  expressing  the  thought — 
precisely,  concisely,  unambiguously — in  the  form  of 
words.  There  is  no  guarantee,  therefore,  that  the 
author’s  written  text  is  a  reliable  image  of  his  original 
idea. 

Furnished  with  this  more  or  less  distorted  record, 
the  translator  is  expected  to  perform  a  number  of 
amazing  feats.  In  the  first  place,  he  has  to  discern — 
often  through  the  dim  mist  of  the  source  language — 
the  writer’s  precise  intention.  This  requires  not  only  a 
perfect  knowledge  of  both  the  source  language  and  the 
subject  matter  treated  in  the  text,  but  also  the  mental 
skills  customarily  exercised  by  the  professional  sleuth. 
In  addition,  these  newly  reconstructed  ideas  must  be 
rendered  into  a  target  language  which  is  so  unequivo¬ 
cal  and  so  faithful  to  the  source — as  to  convey,  to  every 
reader  of  the  translator’s  product,  the  exact  meaning 
of  the  original  foreign  text! 

Small  wonder,  then,  that  a  fabulous  achievement 
like  Fitzgerald’s  translation  of  the  Rubaiyat  is  re¬ 
garded  in  the  nature  of  a  miracle.  For  the  general  case, 
it  would  seem  that  characterizing  a  sample  of  the 
translator’s  art  as  a  good  translation  is  akin  to  charac¬ 
terizing  a  case  of  mayhem  as  a  good  crime:  in  both 
instances  the  adjective  is  incongruous. 

If,  as  a  crowning  handicap,  we  are  asked  to  replace 
the  vast  capacity  of  the  human  brain  by  the  paltry 
contents  of  an  electronic  contraption,  the  absurdity  of 
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aiming  at  anything  higher  than  a  crude  practical  trans¬ 
lation  becomes  eminently  patent. 

Perhaps  we  are  belaboring  this  point;  we  do  so  to 
avoid  later  arguments  about  the  “quality”  of  our  work. 
If,  for  example,  a  translated  article  enables  a  scientist 
to  reproduce  an  experiment  described  in  a  source 
paper  and  to  obtain  the  same  results, — such  a  transla¬ 
tion  may  be  regarded  as  a  practical  one.  Perhaps  the 
translation  is  not  couched  in  elegant  terms;  here  and 
there  several  alternative  meanings  are  given  for  a  tar¬ 
get  word;  a  word  or  two  may  appear  as  a  mere  trans¬ 
literation  of  original  source  words.  Nevertheless,  this 
translation  has  served  its  main  purpose:  a  scholar  in 
one  land  can  follow  the  work  of  his  colleague  in  another. 

This  limited  scope  has  been  set  for  us  by  our  own 
as  well  as  the  machine’s  deficiencies.  The  heartbreak¬ 
ing  problem  which  we  face  in  mechanical  translation 
is  how  to  use  the  machine’s  considerable  speed  to 
overcome  its  lack  of  human  cognizance.  We  do  not  yet 
really  understand  how  the  human  mind  associates 
ideas  at  its  immense  rate  of  speed;  for  example,  how 
does  it  differentiate  seemingly  instantaneously  between 
the  two  meanings  of  calculus  in  the  following  sen¬ 
tences:  (1)  The  surgeon  removed  the  staghorn  calculus 

from  the  patient’s  kidney,  and  (2)  The  professor  an¬ 
nounced  a  new  course  in  advanced  calculus.  And  yet, 
a  scheme  for  discerning  such  differences  is  what  we 
must  impart  to  the  machine. 

Even  if  there  now  existed  a  completely  satisfactory 
method  for  machine  translation.  today’s  machines 
would  not  be  adequate  tools  for  its  implementation. 
They  lack  automatic  transformers  of  printed  text  into 

coded  signals,  and  their  external  storage  devices  are 
not  up  to  the  mark. 

Before  coming  to  grips  with  the  mechanical  trans¬ 

lation  problem,  we  investigated  the  types  of  difficulties 
we  might  encounter.  We  found  that  they  fall  into  ten 
groups;  so  far,  we  have  been  able  to  cope — more  or  less 
successfully — with  only  the  first  five,  which  depend 
mainly  on  syntactic  analysis.  Some  thought  has  been 
given  to  the  far  more  difficult  points  involving  seman¬ 
tic  considerations,  but  the  short  time  spent  in  this  area 
has  not  allowed  us  to  transform  the  mathematical 
“existence  solutions”  into  practical  machine  applica¬ 

tion.  Thus,  discussion  of  semantic  problems  is  deferred. 


33 


Report  Documentation  Page 


Form  Approved 
OMB  No.  0704-0188 


Public  reporting  burden  for  the  collection  of  information  is  estimated  to  average  1  hour  per  response,  including  the  time  for  reviewing  instructions,  searching  existing  data  sources,  gathering  and 
maintaining  the  data  needed,  and  completing  and  reviewing  the  collection  of  information.  Send  comments  regarding  this  burden  estimate  or  any  other  aspect  of  this  collection  of  information, 
including  suggestions  for  reducing  this  burden,  to  Washington  Headquarters  Services,  Directorate  for  Information  Operations  and  Reports,  1215  Jefferson  Davis  Highway,  Suite  1204,  Arlington 
VA  22202-4302.  Respondents  should  be  aware  that  notwithstanding  any  other  provision  of  law,  no  person  shall  be  subject  to  a  penalty  for  failing  to  comply  with  a  collection  of  information  if  it 
does  not  display  a  currently  valid  OMB  control  number. 


1.  REPORT  DATE 

NOV  1961 

2.  REPORT  TYPE 

3.  DATES  COVERED 

00-00-1961  to  00-00-1961 

4.  TITLE  AND  SUBTITLE 

A  New  Approach  to  the  Mechanical  Syntactic  Analysis  of  Russian 

5a.  CONTRACT  NUMBER 

5b.  GRANT  NUMBER 

5c.  PROGRAM  ELEMENT  NUMBER 

6.  AUTHOR(S) 

5d.  PROJECT  NUMBER 

5e.  TASK  NUMBER 

5f.  WORK  UNIT  NUMBER 

7.  PERFORMING  ORGANIZATION  NAME(S)  AND  ADDRESS(ES) 

National  Bureau  of  Standards, Gaithersburg, MD, 20899 

8.  PERFORMING  ORGANIZATION 

REPORT  NUMBER 

9.  SPONSORING/MONITORING  AGENCY  NAME(S)  AND  ADDRESS(ES) 

10.  SPONSOR/MONITOR'S  ACRONYM(S) 

11.  SPONSOR/MONITOR'S  REPORT 
NUMBER(S) 

12.  DISTRIBUTION/AVAILABILITY  STATEMENT 

Approved  for  public  release;  distribution  unlimited 

13.  SUPPLEMENTARY  NOTES 

14.  ABSTRACT 

15.  SUBJECT  TERMS 

1 

1 

16.  SECURITY  CLASSIFICATION  OF: 


a.  REPORT 

unclassified 


b.  ABSTRACT 

unclassified 


c.  THIS  PAGE 

unclassified 


17.  LIMITATION  OF 
ABSTRACT 

Same  as 
Report  (SAR) 


18.  NUMBER 
OF  PAGES 

18 


19a.  NAME  OF 
RESPONSIBLE  PERSON 


Standard  Form  298  (Rev.  8-98) 

Prescribed  by  ANSI  Std  Z39-18 


In  this  paper  we  are  concerned  mainly  with  syntactic 
analysis. 

The  Glossary 

One  of  the  indispensable  accessories  of  MT  is  the 
construction  of  a  specialized  source-to-target  glossary. 
The  conventional  publications  would  not  suffice  for 
MT,  because  their  authors  presuppose,  on  the  part 
of  the  prospective  user,  (1)  a  wide  acquaintance  with 
the  basic  principles  of  the  source  language,  (2)  an 
excellent  knowledge  of  the  target  language,  and  (3)  a 
considerable  familiarity  with  the  terminologies — in 
both  languages — relating  to  the  special  subject  of  the 
source  text.  These  assumptions  are  hardly  justified  even 
in  the  case  of  the  professional  translator.  It  follows  that 
a  glossary,  designed  for  use  with  an  electronic  proces¬ 
sor,  must  embody  an  immense  amount  of  information 
in  addition  to  the  material  culled  from  the  best  exist¬ 
ing  dictionaries.  But  there  is  a  limit  to  the  amount  of 
data  that  can  be  handled  by  even  the  most  advanced 

type  of  electronic  processor,  if  MT  is  to  be  at  all 
expedient.  It  is  imperative,  therefore,  that  utmost  care 
be  used  to  select  (1)  the  absolutely  minimum  quantity 
of  information  which  would  suffice  for  our  needs,  (2) 
the  most  economical  (space  and  time-saving)  form  for 
representing  it,  and  (3)  the  most  suitable  external 
media  for  its  storage  and  retrieval. 

Of  far  greater  concern  is  the  fact  that  we  are  not 
fully  aware  of  the  mental  processes  involved  in  the 
performance  of  the  translation  task.  Yet  a  routine, 
paralleling  these  processes,  must  be  prepared  for  in¬ 
sertion  into  the  machine’s  memory.  Unfortunately,  the 
form  of  the  glossary  depends  upon,  and  varies  with, 
the  particular  translation  scheme  which  is  being  devel¬ 
oped.  We  would  not  venture  to  predict  the  date  when 

our  own  glossary  might  assume  its  final — or  even 

“passable” — shape.  We  are  constrained,  for  the  present, 
to  use  a  small  sample  glossary,  sufficient  for  trial  runs 
on  the  computer.  It  is  stored  in  the  external  memory 
and  is  arranged  in  groups,  each  of  which  lists  the 
Satellites  of  a  source  Pseudo-root.*  Each  satellite  is  an 
entry  corresponding  to  a  source  Stem  which  contains 
the  pseudo-root  in  question.  The  temporary  form, 
which  each  Glossary  Entry  has  assumed  so  far,  consists 
of  the  following  items: 

1.  The  Source  Transform,  which  is  a  greatly  con¬ 

tracted  form  of  the  original  source  stem. 

2.  Morphological  information,  designed  to  aid  in 
the  syntactical  analysis  of  each  sentence,  as  illustrated 
in  Section  B  of  Part  II. 

3.  Predictions  regarding  future  Occurrences.  For 
instance,  the  Russian  verb  with  stem  CJIYUC  is  marked 
as  frequently  followed  by  an  indirect  object  in  the 
dative  case  and/or  a  complement  in  the  instrumental; 
also  sometimes  by  a  verb  in  the  infinitive. 

4.  One  or  more  target  correspondents  (T)  to  the 
source  stem. 


The  List  of  Terms  and  List  of  Symbols  at  the  end  of  the  paper 
may  enable  the  reader  to  identify  unfamiliar  expressions.  Technical 
words  to  be  found  therein  are  capitalized  when  first  encountered  in 
the  text. 


(It  is  planned  to  expand  this  information  to  include 
diacritical  material  designed  to  aid  in  the  semantic 
analysis  of  the  sentence.) 

PARTI 

Our  program  is  being  coded  in  two  parts.  Of  these 

only  the  first,  which  consists  of  two  sections,  has  been 
completed  and  tested. 

Section  A. 

The  aim  of  this  section  is  to  investigate  the  nature  of 

each  Occurrence  in  a  sentence  and,  for  the  case  when 
the  occurrence  is  a  word,  to  perform  a  glossary  look-up. 
When  an  occurrence  in  a  given  Russian  text  is  read 
into  the  machine — and  we  have  reason  to  hope  that 

this  will  be  accomplished  eventually  by  a  fully  auto¬ 

matic  device — this  source  material  is  subjected  to  the 
following  treatment  within  the  computer. 

1.  An  Identification  Tag  (t)  is  appended  to  the 
occurrence  to  indicate  the  page,  sentence,  and  serial 
number.  Its  characters  are  counted  and  examined  for 
indications  anent  its  physical  make-up.  For  instance, 
the  machine  examines  whether  the  occurrence  is  a 
word,  or  perhaps,  a  punctuation  mark,  formula,  etc. 
If  a  word,  it  notes  whether  it  starts  with  a  capital  or 
is  an  initial,  whether  it  contains  any  indication  of 
foreign  origin.  This  orthographical  material  will  be 
augmented  and  revised  in  succeeding  steps  to  form 
General  Specifications  (GS).  It  is  recorded  in  the  in¬ 
ternal  memory  space  S,,  allotted  to  the  occurrence  t. 

2.  If  the  current  occurrence  is  not  a  word,  this  fact 
is  indicated  in  the  Profile  Skeleton  (PS)  which  will 
eventually  be  expanded  to  serve  as  a  rough  outline  of 
the  clause  formation  of  the  source  sentence  to  which 
the  occurrence  belongs.  If,  moreover,  the  occurrence 
is  identified  as  a  period,  a  subroutine  is  consulted  to 
determine  whether  this  punctuation  marks  the  end  of 
the  sentence.  If  such  be  the  case,  this  fact  is  indicated 
in  the  profile  skeleton,  and  the  sentence  number  is 
raised  for  storage  in  the  succeeding  tag  numbers,  t. 

3.  If  the  given  occurrence  is  a  word,  a  search  is 
made  in  a  Special  List  of  frequently  used  words.  If  the 
word  is  found  in  the  special  list,  the  diacritical  mate¬ 
rial  accompanying  it  may  show  that  it  could  be  the 
leading  word  of  one  or  more  idioms.  In  that  case,  the 
requisite  number  of  successive  source  occurrences  will 
be  compared  to  each  of  the  indicated  idioms,  and 
when  agreement  is  found,  the  entire  source  idiom  is 
replaced  by  the  corresponding  material  and  is  there¬ 
after  treated  as  a  single  occurrence. 

4.  If  the  word  is  not  found  in  the  above  list,  it  is 
decomposed  into  its  Pseudo-prefixes,  pseudo-root  (or 
roots).  Pseudo-suffixes,  and  Source  Ending  by  means 
of  corresponding  Lists  stored  in  the  internal  memory 
(the  pseudo-root  and  true  source  ending  are  deter¬ 
mined  by  a  rather  complicated  iterative  scheme.) 

The  ending  is  replaced  by  the  address  /?,  found 
alongside  its  listed  counterpart.  It  is  stored  in  S,  and 
will  be  used  in  Part  II. 
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Each  pseudo-prefix  and  pseudo-suffix  (if  any)  is 
replaced  by  a  single  character,  consisting  of  6  bits,  and 
the  combination  of  these  characters  (probably  no  more 
than  8)  constitutes  the  transform  (A)  of  the  original 
source  word;  y  and  z,  the  number  of  pseudo-prefixes 
and  pseudo-suffixes,  as  well  as  A,  are  stored  in  St. 

The  remaining  portion  of  the  current  word,  consti¬ 
tuting  the  pseudo-root,  may  have  no  characters  at  all. 
The  glossary  contains  a  group  of  satellites  for  a  null 
pseudo-root,  whose  Extended  Address,  do.  is  used  to 
represent  it  in  the  next  step. 

If  the  pseudo-root  contains  at  least  one  character, 
it  may  not  have  been  found  in  the  list  of  pseudo-roots. 
In  that  case,  the  transliteration  subroutine  dictates  the 
form  of  the  correspondent  to  be  stored  in  the  normal 
position  of  the  target  T  for  the  final  printout.  A  suitable 
Signal  of  Peculiarity  (8)  is  stored  in  GS.  The  Corre¬ 
spondence  Flag  (c)  in  GS  is  set  to  zero. 

If  the  pseudo-root  has  been  located  in  the  list,  its 
counterpart  is  accompanied  by  an  extended  address,  a, 
indicating  where  its  group  of  satellites  starts  in  the  ex¬ 
ternally  stored  glossary. 

5.  The  extended  address,  a,  accompanied  by  the 
identification  tag  t,  is  intersorted  with  similar  combina¬ 
tions.  corresponding  to  the  previously  processed  source 
words,  in  the  Sorting  File. 

6.  When  all  the  internal  space  allotted  for  the  sort¬ 
ing  file  is  filled,  a  search  is  made  throughout  the  entire 
glossary  for  the  indicated  entries.  Since  the  time  for 
such  a  transit  throughout  the  glossary  is  formidable, 
and  remains  practically  constant  irrespective  of  the 
number  of  words  to  be  looked  up,  it  is  obvious  that  an 
appreciable  increase  in  internal  storage  space  would 
result  in  a  corresponding  reduction  in  the  look-up  time 
per  word.  However,  considering  the  high  cost  of  in¬ 
ternal  storage  devices,  it  might  be  more  expedient  to 
utilize  inexpensive  non-erasable  external  storage  media 
with  suitable  buffering  devices  which  allow  for  the 
simultaneous  retrieval  of  information  along  several 
channels. 

7.  When  the  extended  address  a  attached  to  t  is 
reached  during  transit  of  the  glossary,  the  routine 
searches  for  the  entry  corresponding  to  the  y.  z.  A  of 
the  occurrence  t.  The  correspondence  flag  c  is  set  to  1 
or  0  in  GS,  according  to  whether  the  search  has  been 
successful  or  not.  In  the  latter  case,  the  pertinent 
peculiarity  signal  is  stored  in  GS  and  the  tag  t  is  placed 
in  the  normal  position  of  the  target  T  for  final  printout. 

ILLUSTRATION  1. 

As  an  example  of  the  performance  of  this  section  of  the 
program,  we  offer  the  text  word  PACIIOJJOKEHHE. 
Suppose  this  word  occurs  as  the  7th  word  of  the  4th 
sentence  on  page  1.  The  corresponding  symbol  for  t  is 
1.4.7.  The  occurrence  is  examined  and  found  to  be  a 
word  (not  a  punctuation  mark  etc.)  composed  of  12 
letters.  The  Word  Flag  (w)  in  GS  would  be  set  to  1. 
The  machine  determines  that  no  such  word  appears 
in  the  special  list  of  frequently  used  words.  The  oc¬ 


currence  is  therefore  examined  for  pseudo-prefixes.  In 
this  case,  the  combinations  PAC  and  770  happen  to  be 
true  prefixes.  By  referring  to  the  stored  list  of  pseudo¬ 
prefixes,  the  routine  would  replace  PAC  by  the  letter 
V  and  no  by  the  letter  R.  Unable  to  discover  more 
prefixes,  the  routine  would  isolate  the  ending  HE. 
Suppose  that  the  list  of  endings  indicates  that  infor¬ 
mation  on  this  ending  is  stored  in  internal  memory 

beginning  at  address  357;  the  machine  then  sets  /?  = 
357.  The  routine  would  proceed  to  identify  EH  as  a 
suffix  and  replace  it  by  the  letter  K.  Finding  no  more 
pseudo-suffixes,  the  routine  would  store  in  S  1,4,7  the 

numerals  2  and  1,  to  indicate  the  number  of  prefixes 

and  suffixes  y  and  z;  these  would  be  followed  by  the 

transform  A,  which  is  VRK.  The  machine  would  then 

enter  the  subroutine  for  identifying  the  pseudo-root. 
In  the  present  case,  no  difficulties  would  be  en¬ 

countered,  as  JIO)K  would  be  located  at  once  in  the 
list  of  pseudo-roots.  In  actual  practice,  a  number  of 
complications  may  arise.  The  given  word  may  contain 
a  polyroot;  or  what  we  assumed  to  be  an  ending  may 
actually  be  part  of  the  pseudo-root;  or  we  may  not  be 
able  to  locate  the  root  at  all.  The  sub-routine  takes 

note  of  all  these  possibilities. 

The  root  JIO)K  is  replaced  by  a  which  would  be, 

say,  2.47.3097,  if  the  first  member  in  the  group  of 

this  root’s  satellites  has  the  position  number  3097  in 
the  47th  block  on  the  2nd  tape.  To  a  we  attach  the 

tag  t  and  intersort  the  result  with  the  other  contents  of 
the  sorting  file.  The  entry  in  the  internal  memory,  cor¬ 
responding  to  the  occurrence  PACnOJIO)KEHHE, 

now  has  the  two  forms: 


Storage 

GS 

P 

y.z  A 

S 1,4,7 

Orthographic 

description 

357 

2.1  VRK 

a 

t 

Sorting 

File 

2.47.3097 

1.4.7 

After  a  specified  number  of  successive  occurrences 
have  been  analyzed  in  this  way,  a  transit  will  be  made 
through  the  glossary.  When  the  position  3097  of  the 
47th  block  on  the  2nd  tape  is  reached,  the  machine 
will  locate  and  extract  all  the  material  corresponding 
to  2.  1.  VRK,  i.e.  all  the  information  pertinent  to  the 
stem  PACnOJIO)KEH.  In  GS,  the  correspondence  flag 
c  would  be  set  to  1  to  indicate  that  the  search  had 
been  successful. 

Section  B. 

In  this  section  we  examine  each  word-occurrence  of  a 
sentence  with  two  aims  in  view: 

1.  To  assign  to  it  all  possible  grammatical  inter¬ 
pretations,  which  we  call  Temporary  Choices,  TC/. 
These  are  arranged  roughly  in  order  of  most  probable 
appearance;  /  indicates  the  serial  number.  Information 
common  to  all  TCj  is  labeled  with/=  0. 
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2.  To  indicate  its  significance  in  the  profile  skeleton. 
To  accomplish  the  first  aim  we  distinguish  three  types 
of  words: 

a.  If  a  source  word  is  found  in  the  special  list  of 

frequently  used  words,  its  various  TQ  are  ex¬ 
plicitly  listed  there. 

b.  For  a  word  whose  transform  is  found  in  the 
glossary,  the  TQ'  are  obtained  by  finding  the 
common  intersection  between  the  possibilities 
given  by  its  ending  in  the  Table  of  Endings  and 
those  given  by  the  morphological  information  of 
the  stem"  s  glossary  material. 

c.  When  a  source  word  is  represented  merely  by 

its  transliteration,  the  TQ  must  be  made  on  the 

basis  of  its  ending  (and,  possibly,  its  suffixes) 
only. 

As  regards  the  second  aim,  the  TQ  which  accompany 
a  current  word  may  reveal  that  it  could  be  a  possible 
indicator  of  a  main  clause,  or  subordinate  clause,  or  a 
phrase.  If  such  is  the  case,  an  appropriate  signal  is 
added  to  the  profile  skeleton,  in  which  the  nature  of 
the  non-word  occurrences  has  previously  been  stored. 
The  profile  skeleton  will  be  subjected  to  a  crude  analy¬ 
sis  in  Section  A  of  Part  II. 

ILLUSTRATION  2. 

Let  us  use  again  the  word  PACnOJIOPKEHPIE,  be¬ 
longing  under  the  heading  2b  above.  The  glossary’s 
morphological  information  indicates  that  its  stem, 
PACnOJIOyKEH.  could  represent  either 

1.  An  inanimate  neuter  noun,  belonging  to  a  de¬ 
clension  class  which  is  identified  by  the  ending  EIE  in 

the  nominative  singular;  or 

2.  An  adjective,  of  verbal  origin,  belonging  to  a 
declension  class  which  is  identified  by  the  ending  bill 
in  the  masculine  nominative  singular. 

This  material,  used  in  conjunction  with  the  infor¬ 
mation  listed  for  the  ending  PIE  leads  the  machine  to 
eliminate  the  second  possibility  given  by  the  glossary 
and  to  list  the  following  two  temporary  choices: 

TCo  Noun,  inanimate,  neuter  (common  to  both) 

TQ  nominative,  singular 

TCt  accusative,  singular 

This  word  does  not  call  for  the  insertion  of  a  signal 
into  the  profile  skeleton  (PS). 

PART  II 

Part  II  of  the  projected  scheme,  now  in  process  of  be¬ 
ing  programmed,  has  the  putpose  of  analyzing  the 
syntactical  structure  of  each  source  sentence  and  of 
constructing  a  corresponding  target  sentence.  While 
Part  I  works  on  at  least  several  hundred  source  words 
in  one  pass — the  number  of  such  words  is  determined 
by  the  internal  memory  capacity  of  the  machine — Part 
II,  which  is  made  up  of  three  sections,  works  on  one 
sentence  at  a  time. 

Section  A  determines,  as  far  as  possible  at  this  stage, 
the  clausal  and  phrasal  structure  within  the  sentence. 
Section  B  is  an  iteration  scheme  for  examining  syntac¬ 
tical  relations  among  the  Strings  of  a  sentence.  It  proc¬ 


esses  each  string  in  turn  from  the  beginning  to  the  end 
of  each  sentence,  repeats  this  process  if  necessary  and 
decides  whether  a  translation  has  been  effected.  There¬ 
after  Section  C  takes  over,  composes  a  target  sentence, 
and  prints  it  out. 

Types  of  Difficulties. 

We  shall  list,  in  order  of  increasing  complexity,  the 
ten  difficulties  which  obstruct  our  path  toward  such  a 
goal: 

1 .  The  stem  of  a  source  word  is  not  listed  in  our 
glossary.  This  will  occur  quite  often  in  our  translation 
scheme,  as  we  intend  to  omit  from  the  glossary  the 
majority  of  non-Slavic  stems. 

2.  The  target  sentence  requires  the  insertion  of  key 

English  words,  which  are  not  needed  for  grammatical 
completeness  of  the  source  sentence.  For  instance,  the 
complete  Russian  sentence:  OH  EEJJHblPI  (literally 
He  poor )  should  be  translated  as  He  (is)  (a)  poor 
( man ). 

3.  The  source  sentence  contains  well-known  idio¬ 
matic  expressions. 

4.  The  occurrences  of  a  source  sentence  do  not  ap¬ 

pear  in  the  conventional  order.  Sober  writing,  without 
color  or  emphasis,  employs  few  inversions.  Our  method, 
which  consists  of  predicting  each  occurrence  on  the 
basis  of  the  preceding  ones,  works  quite  well  in  that 
case.  But  such  orderliness  cannot  be  expected  to  hold 

for  long  stretches  of  the  text. 

5.  The  source  sentence  contains  more  than  one 
clause. 

6.  Corresponding  to  an  occurrence  in  the  source 
sentence,  more  than  one  target  word  is  listed  in  the 
glossary.  Polysemy  is,  of  course,  recognized  as  a  most 
formidable  obstacle  to  faithful  translation,  whether 
human  or  mechanical.  Hilarious  (or  heartbreaking,  de¬ 
pending  on  your  point  of  view)  “malaprops”  can  be 
cited  by  the  score  to  uphold  the  conviction  of  many 
linguists  that  the  MT  task  is  a  hopeless  one.  Our  faith 
in  the  inventiveness  of  the  human  brain  makes  us  re¬ 
ject  such  gloomy  forebodings. 

7.  The  source  sentence  is  grammatically  incom¬ 
plete.  Such  a  situation  is  frequently  the  result  of 

carrying  on  the  thought  from  one  or  more  previous 

sentences.  To  succeed,  any  MT  scheme  will  have  to 
be  able  to  transcend  the  boundaries  of  a  sentence  (or 
a  paragraph,  or  a  section). 

8.  The  source  sentence  contains  ambiguous  sym¬ 
bols.  Since  we  are  planning  to  confine  our  efforts  to 
mathematical  texts,  such  occurrences  will  be  legion. 

9.  The  syntactic  integration  of  the  source  sentence 

results  in  an  ambiguity.  It  is  often  of  a  type  that  could 
be  resolved  by  semantic  considerations;  but  sometimes, 

it  is  inherent  and  thus  not  removable  by  any  process. 

10.  A  combination  of  difficulties  is  listed  in  this 

category.  They  are  quite  annoying  but  fortunately  rare: 

misprints;  grammatical  errors;  localisms;  peculiar  nu¬ 
ances;  comments  based  upon  the  sound  (or  the  spell¬ 
ing)  of  source  occurrences,  such  as  puns  whose  sense 
it  is  impossible  to  render  into  the  target  language. 
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We  have  thus  grouped  Russian  sentences  into  210, 

i.e.  1024,  types.  A  sentence  possessing  none  of  the  ten 
difficulties  would  be  represented  by  type  number  00000 
000002  whereas — at  the  other  end — a  sentence  exhibit¬ 
ing  all  the  difficulties  would  belong  to  type  11111 

11111,=  102310. 

Our  scheme  is  able  to  cope  successfully — we  believe 
— with  the  first  five  types  of  difficulties,  which  involve 
only  monosemantic  occurrences,  or  at  most  idiomatic 
expressions.  We  can  thus  handle  32  types  of  sentences 
ranging  in  type  number  from  00000  000002  to  00000 
111112. 

Section  A. 

In  both  sections  of  Part  I  we  kept  up,  for  each  source 
sentence,  a  profile  skeleton  which  consists  of  a  set  of 
signals  denoting  to  which  special  class  (if  any)  each 
occurrence  belongs.  This  tentative  outline  serves  to  in¬ 
dicate  where  the  clauses  and  phrases  of  the  sentence 
might  have  their  inception.  The  routine  in  the  present 

section  carries  out  an  iterative  process  which  aims  to 
set  rough  limits  to  these  ranges,  based  upon  the  posi¬ 
tion  in  the  sentence  of  its  (1)  punctuation  marks,  (2) 

conjunctions,  (3)  actual,  or  possible,  starters  of  main 
clauses,  (4)  actual,  or  possible,  starters  of  subordinate 

clauses,  (5)  actual,  or  possible,  predicates  for  each 
clause,  and  (6)  actual,  or  possible,  phrase  starters. 

As  a  result  of  this  iterative  scheme,  the  profile  skele¬ 
ton  PS  is  replaced  by  a  Temporary  Profile  (TP),  in 
which  each  occurrence  is  associated  with  four  desig¬ 

nators: 

1 .  Its  clause  number  (C), 

2.  A  Status  Flag  (v)  to  indicate  whether  the  predi¬ 
cate  of  the  clause  has  or  has  not  occurred, 

3.  Its  phrase  number  (P),  and 

4.  A  Backward  Flag  (b)  to  indicate  a  particular 
manner  in  which  the  string  is  to  be  handled  during  the 
process  of  syntactic  integration. 

In  the  event  that  the  routine  does  not  succeed  in 

determining  a  clause  or  phrase  number,  it  will  insert 
a  Signal  of  Uncertainty  (X),  which  the  routine  in 

Section  B  will  attempt  to  resolve. 

Section  B. 

At  the  conclusion  of  the  preceding  section,  each  source 
occurrence  has  been  replaced  by  a  string  of  informa¬ 
tion  which  will  expand  as  we  progress  in  the  integra¬ 
tion  scheme.  The  string,  at  this  point,  contains  several 
sets  of  data: 

1 .  A  set  of  general  specifications,  GS,  consisting  of 

a.  a  word  flag,  w,  indicating  whether  the  occur¬ 
rence  was  or  was  not  a  Word-utterance  (W). 

b.  a  correspondence  flag,  c,  indicating  whether 
or  not  the  occurrence  (or  its  transform)  was 
located  in  the  storage. 

c.  a  peculiarity  signal,  8,  pointing  out  any  signi¬ 
ficant  feature  of  the  occurrence. 

2.  A  set  of  four  designators,  belonging  to  the  tem¬ 
porary  profile,  TP. 

3.  If  the  occurrence  was  a  W,  its  string  will  have 
in  addition 


a.  a  set  of  temporary  choices,  TCy,  giving  all 
possible  grammar  interpretations  of  the  source 
word. 

b.  a  set  of  target  correspondents,  T,  if  the  word 
(or  its  transform)  has  been  located  in  the 
memory;  otherwise  the  correspondent  will  be 
either 

1)  the  transliteration  of  all,  (or  part)  of  the 
word-utterance,  if  its  pseudo-root  is  not 
listed;  or  else 

2)  the  identification  t,  if  its  transform  is  not 
in  the  glossary. 

c.  a  set  of  Glossary  Predictions  (GP),  retrieved 
from  the  memory  if  such  exist,  each  consisting 
of 

1)  a  Grammar  Essential  (GE),  indicating  the 
predicted  type  of  agreement  with  a  tem¬ 
porary  choice. 

2)  a  Signal  of  Urgency  (i /),  indicating  the 
probability  of  fulfillment. 

3)  In  many  cases,  a  Pretarget  Insert  (PI), 

indicating — in  coded  form — the  English 

word(s)  which  is  (are)  to  precede  the 
target(s). 

In  addition  to  the  above  items,  there  may  be  avail¬ 
able  at  any  stage  of  the  iterative  process  the  following 
information,  which  has  been  generated  during  the  pre¬ 
ceding  portion  of  Section  B. 

1.  Foresight  Predictions  (FP).  Expectations  for 
future  strings,  based  on  past  occurrences;  e.g.  a  direct 
object  is  governed  by  a  transitive  verb.  A  foresight 
prediction  contains  at  least  three  specifications: 

a.  Serial  number,  k,  to  distinguish  the  different 
foresights  generated  by  the  same  string. 

b.  Urgency  Code  (U),  designating  the  degree 
of  necessity — or  the  proximity — of  the  ex¬ 
pected  string,  (e.g.  a  code  of  1  indicates:  next 
occurrence  or  not  at  all). 

c.  Sentence  Element  (SE),  such  as  Subject, 
Predicate,  Complement,  etc. 

In  addition  to  the  above  items,  which  are  always  pres¬ 
ent,  a  foresight  prediction  may  contain  data,  in  the 
form  of 

d.  Morphological  Specifications  (MS)  regarding 
animation,  gender,  number,  etc. 

e.  An  Insert  Flag  ( e )  to  indicate  whether  or  not 
an  English  preposition  is  to  be  inserted  before 
the  target  correspondent,  T. 

2.  Hindsight  (Hj)  regarding  troublesome  strings, 
When  a  Predictable  Choice  does  not  agree  with  any  of 
the  previous  FP,  Hindsight  Entries  about  this  Unex¬ 
pected  Choice  are  stored  together  with  a  Chain  Flag 
(/)  in  Hi,  to  be  considered  with  subsequent  strings. 
Such  apparent  inconsistencies  must  all  be  resolved  at 
the  conclusion  of  the  sentence,  as  a  necessary  (but  not 
sufficient)  criterion  of  successful  syntactical  integra¬ 
tion.  Here,  too,  are  stored  queries  about  strings  whose 
syntax  is  questionable,  even  though  they  seemingly  ful¬ 
fill  previous  predictions.  Entries  in  Hj  concerning  these 
Doubtful  Choices  are  not  flagged. 
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3.  Hindsight  (H2)  regarding  predicted  alternate 
temporary  choices.  It  may  happen  that  more  than  one 
of  the  temporary  choices  TC;  agree  with  previously 
made  predictions.  In  this  case,  one  is  selected  as  a  link 
in  the  sentence  structure  and  the  others  are  stored  for 
future  consideration  in  the  current  (and  subsequent) 
iterations. 

4.  Hindsight  (H3)  regarding  the  remaining  unpre¬ 
dicted  temporary  choices  TC(.  These  are  “pigeonholed” 
for  possible  use  in  subsequent  iterations. 

5.  Chain  number  (L).  Whenever  the  machine,  in 
proceeding  through  a  sentence,  encounters  a  string 
which  it  is  unable  to  link  with  any  previous  predictions, 
it  starts  a  new  Chain.  There  exist,  however,  five  types 
of  Unpredictable  Choices  which  do  not  cause  a  new 
chain  to  be  started.  They  represent  (a)  punctuation 
marks,  (b)  conjunctions,  (c)  adverbs,  (d)  particles, 
and  (e)  prepositions. 

The  Routine  of  Section  B  begins  with  the  following 
steps: 

1.  All  the  hindsight  entries,  left  in  storage  from  the 
previous  sentence,  are  cleared  out. 

2.  The  chain  number  L  is  set  to  1. 

3.  The  following  two  predictions,  for  the  main 
clause,  are  stored  as  foresights: 

CU.SE 

1.7.  Subject 

2. 7.  Predicate 

where  k  is  the  serial  number  within  the  string;  U  is 
the  urgency  code  (7  indicates  the  highest);  and  SE  is 
the  sentence  element  of  the  prediction. 

We  now  attempt  to  determine  the  syntactic  sen¬ 
tence  stiucture  by  observing  the  following  routine  for 
each  string.  (The  letter  q  will  indicate  the  current 
String  number;  Q  will  denote  this  running  coordinate 
as  it  ranges  from  1  to  q\ )  K  and  J  will  denote,  respec¬ 
tively,  the  k  and  j  within  the  string  Q. 

1.  The  routine  examines  the  unfulfilled  FPqK  within 
the  current  clause  or  phrase,  in  decreasing  order  of  Q 
and  increasing  order  of  K.  Each  of  them  is  tested  for 
agreement  with  any  of  the  TC;.  The  first  TC  which 
fits  an  FP  is  taken  as  the  Selected  Choice  (SC)  for  this 
iteration.  The  successful  FP  is  deleted.  If  there  are 
several  TC;  and  none  of  them  fit  any  FPqK,  the  hind¬ 
sight  information  is  examined  for  possible  clues  regard¬ 
ing  the  selection  of  a  TC;  to  act  as  the  SC.  If  no  clue 
is  found,  TCi  becomes  the  SC.  If,  however,  the  string 
was  marked  by  a  backward  flag  b,  the  examination  of 
foresight  predictions  is  omitted.  In  this  case  the  routine 
examines — in  reverse  order — the  previous  selected 
choices,  SC,  for  agreement  with  TCy.  If  the  string  is 
of  the  unpredictable  type,  TCI  is  taken  as  the  SC. 

2.  The  selected  choice  is  indicated  by  Q.K.j.,  where 
Q  is  the  number  of  the  string  where  the  successful  pre¬ 
diction  (if  any)  was  made  and  K  is  the  serial  number 
of  that  prediction.  If  there  is  no  such  prediction  for 
SC,  both  Q  and  K  are  designated  as  0.  The  letter  j,  of 
course,  represents  the  serial  number  of  the  chosen  TC 
in  the  current  string. 

3.  The  chain  number  L  is  left  unchanged,  if  the 


string  has  been  predicted  or  is  of  the  unpredictable 
type;  otherwise  L  is  raised  by  unity. 

4.  The  designators  C,  v,  and  P  of  the  temporary 
profile  TP  are  revised — in  the  light  of  the  SC — to  form 
the  Selected  Profile  (SP).  The  status  flag  v  furnishes 
clues  for  the  subsequent  revision  of  the  clause  number 
C,  and  the  syntactical  integration  determines  the  bounds 
of  each  phrase. 

5.  New  predictions  for  the  foresights  are  culled 
from  three  sources: 

a.  The  temporary  profile,  TP,  of  the  next  string. 
If  the  TP  indicates  that  a  new  clause  is  start¬ 
ing,  the  predictions  of  a  new  subject  and 
predicate  are  entered  as  foresights. 

b.  The  main  routine.  This  may  yield  predictions 
of  a  general  nature  on  the  basis  of  the  SC. 
For  example,  if  the  SC  is  a  noun,  one  such 
prediction  states  that  the  noun  might  be  fol¬ 
lowed  by  a  complement  in  the  genitive  case. 
If  the  SC  is  the  subject,  we  examine  whether 
the  predicate  has  been  found  previously;  if 
not,  we  add  to  the  FP  of  the  predicate  the  in¬ 
formation  that  it  must  agree  with  the  subject 
in  person,  number,  gender,  etc.  Similarly,  if 
the  SC  is  the  predicate,  the  FP  of  the  subject 
— if  unfulfilled — is  amplified. 

c.  The  glossary  predictions,  GP,  accompanying 
the  chosen  TC.  Such  predictions,  if  any,  would 
arise  from  the  peculiar  nature  of  the  original 
occurrence.  For  instance,  a  particular  verb 
may  govern  the  dative  case. 

6.  The  predictions  yielded  by  a  string  are  appraised 
against  the  entries  previously  placed  in  hindsight,  in 
order  to  ascertain  whether  the  former  throw  any  light 
upon  the  difficulties  and  conflicts  represented  by  the 
latter.  If  a  partial  explanation  is  obtained,  a  suitable 
notation  is  made  alongside  the  corresponding  entry. 
Whenever  such  an  entry  is  completely  explained  away, 
it  is  deleted.  If  such  a  deletion  takes  place  in  Hi,  the 
chain  number  L  is  reduced  by  one,  provided  the  entry 
bears  the  chain  flag  /.  Sometimes,  a  rearrangement  in 
order  of  the  strings  is  indicated,  as  a  result  of  the  above 
appraisal. 

7.  The  SC  may  indicate  that  a  key  target  word, 
such  as  a  noun  or  a  verb,  has  not  been  explicitly  stated 
in  the  source  sentence.  If  such  be  the  case,  the  routine 
determines  the  required  Target  Insert  (TI)  and  con¬ 
structs  a  corresponding  New  String.  On  the  other  hand, 
the  SC  may  dictate  the  suppression  of  (a)  target  corre- 
spondent(s). 

8.  A  target  order  number  R  is  assigned  to  the  string, 

to  indicate  the  arrangement  of  occurrences  in  the  target 

language.  In  general,  the  R’s  are  consecutive.  If,  how¬ 
ever,  the  appraisal  in  Step  6  calls  for  a  rearrangement 
of  strings,  or  if  Step  7  resulted  in  the  insertion  of  a  new 
string  (or  the  suppression  of  an  Old  String) — the  af¬ 
fected  R's  are  renumbered  in  accordance  with  the  de¬ 

sired  sequence.  Pretarget  Inserts  (PI),  such  as  prepo¬ 
sitions  and  articles,  are  not  assigned  an  R.  Their  han¬ 
dling  will  be  discussed  in  Section  C. 
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9.  The  TC,  which  do  not  become  the  SC  may,  un¬ 
der  certain  circumstances,  be  disregarded.  In  the  cases 

where  the  routine  directs  the  machine  to  retain  them, 
they  are  entered  into  hindsight  H2  or  H3,  according  to 
whether  they  do  or  do  not  agree  with  any  FP. 

10.  If  the  chain  number  L  was  raised  in  Step  5,  an 
appropriate  query  is  entered  into  hindsight  Hj  with  a 
chain  flag  /.  If  the  SC  is  a  doubtful  choice,  suitable 
queries — unaccompanied  by  the  chain  flag — are  also 
entered  into  Hj. 

When  the  end  of  the  sentence  is  reached,  we  need 

not  embark  upon  another  iteration  if  (1)  the  foresights 

do  not  contain  unfulfilled  predictions  of  urgency  6  and 
7,  and  (2)  the  chain  number  is  1.  (In  that  case  H[ 

should  be  clear  of  flagged  entries.) 

In  this  event,  the  selected  choices  for  all  strings  are 
considered  as  Final  Choices  (FC)  and  the  routine  pro¬ 
ceeds  to  Section  C.  If  however,  another  iteration  is  in¬ 
dicated,  it  investigates  the  H2  information  where  reso¬ 
lution  signals  were  placed  during  the  previous  iteration 
whenever  some  partial  light  was  thrown  upon  any  of 

its  entries.  As  a  result,  one  of  the  former  selected  choices 
is  replaced  by  a  more  promising  one,  and  the  effect  of 
that  change  is  investigated.  It  is  obvious  that,  if  the 
number  of  unresolved  entries  in  FF  is  high,  it  would 
be  prohibitive  to  pursue  all  the  possible  combinations 
of  selected  choices.  We  therefore  set  a  limit  to  the 
number  of  iterations  we  allow  the  machine  to  execute. 
In  the  unlikely  event  that  all  the  possibilities  inherent 
in  the  H2  entries  have  been  exhausted,  the  H3  entries 
are  attacked  in  the  same  manner. 

Failure  is  conceded  when  the  number  of  iterations 

already  performed  has  reached  the  limit  we  had  set 

for  ourselves,  or  when  the  current  set  of  selected  choices 
repeats  any  of  the  previous  sets  (which  are  stored  in 
the  internal  memory).  In  that  case,  the  routine  records 
a  failure  signal  and  indications  of  the  types  of  errors 
encountered,  to  be  printed  out  at  the  conclusion  of 
Section  C. 

Section  C. 

This  section  is  devoted  to  the  construction  and  printing 
of  the  target  sentence. 

1 .  The  target  correspondents  listed  with  the  final 
choices  are  arranged  in  the  sequence  given  by  R. 

2.  A  subroutine  supplies  new  pretarget  inserts  PI. 
in  addition  to  those  supplied  by  the  foresights.  These 
may  be  either  English  articles  or  prepositions.  The  set 


of  PI  (if  any)  are  inserted  in  front  of  the  proper  cor¬ 
respondent  for  eventual  printout. 

3.  A  second  subroutine  affixes  Pidgin  Endings  (E) 
to  target  correspondents  whenever  needed.  (To  con¬ 
serve  precious  internal  space,  we  regard — for  the  pres¬ 
ent — all  English  targets  as  grammatically  regular.  Thus 
the  plural  of  foot  will  appear  as  foot-s.) 

4.  A  count  is  made  of  all  unresolved  hindsight  en¬ 
tries. 

5.  The  resulting  information  is  printed  out.  All  in¬ 
serts,  whether  PI  or  TI  are  printed  in  parentheses. 
Words  for  which  there  are  no  target  correspondents 
are  enclosed  in  brackets.  They  may  appear  as  some 
combination  of  the  following  word-sections: 

a.  a  translated  initial  prefix 

b.  a  transliterated  full  or  partial  stem 

c.  a  transliterated  full  or  partial  word. 

If  the  iterative  routine  failed  to  satisfy  our  criteria,  this 
fact  would  be  indicated  by  the  failure  signal  and  by 
the  notations  of  the  error  types  encountered.  On  the 
other  hand,  the  satisfaction  of  the  criteria  is  no  guar¬ 
antee  that  the  result  is  a  faithful  translation,  unless  all 
three  hindsights  are  clear  and  all  occurrences  are 
monosemantic.  Since  such  eventualities  will  be  ex¬ 
tremely  rare,  we  shall  regard  the  tallies  for  the  hindsight 
entries  and  the  multiplicity  of  the  printed  meanings  as 
a  measure  of  the  “goodness  of  fit”  of  our  version. 

ILLUSTRATION  3. 

The  chart  given  on  the  next  pages  outlines  the  syntac¬ 
tic  integration  of  a  sentence  possessing  the  five  types 
of  difficulty  which  our  routine  is  able  to  handle  with 
some  degree  of  success.  On  the  other  hand,  it  contains 
a  number  of  polysemantic  words,  of  which  only  a  few 
can  be  resolved  at  present.  For  the  remaining  poly¬ 
semantic  words,  we  are  forced  to  print  out  all  the 
meanings  contained  in  our  glossary. 

The  chart  incorporates  all  of  the  steps  entailed  in 
carrying  out  the  first  (major)  iteration  cycle  involving 
the  entire  sentence.  The  reader  may  need  guidance  as 
regards  the  temporal  sequence  of  these  steps;  we  shall, 
therefore,  review  this  sequence  from  the  start  of  the 
process  on  through  the  handling  of  the  first  String  of 
the  sentence.  The  Notes  following  the  chart  are  de¬ 
signed  to  clarify  situations  which  do  not  come  up  in 
String  1.  The  two  Lists  appended  to  this  report  will 
furnish  all  pertinent  definitions.  Ail  terms  mentioned 
therein  are  capitalized  in  the  material  which  follows. 
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1.  The  portions  headed  “Part  I”  and  “T”  list  the  mate¬ 
rial  obtained  prior  to  the  initiation  of  the  syntactic  process. 


PART  I 

Source 

Occurt* 

Gen. 

Spcfcs. 

Temp.  Chs,  (TC  j  ) 

Gfss  Preds. 

PS 

Morph.  Spcfcs. 

IGP) 

IGS) 

(MS) 

GE  a  u 

j_ 

Ps|  A  I  D  I  ts  I G  I  Y  I  Z 

g  |  o  |  d  |  t  |  x  |  m 

t 

Ho- 
*aa- 
a- 
T b 

1  i  Cap 

_ 

l 

Vb  if 

7  111 

vi 

Heading 

Explanation 

Occurrence 

This  portion,  although  overwritten  in  Fart 
II,  is  retained  here  for  the  convenience  of  the 
reader.  It  indicates,  on  separate  lines,  the  1 ) 
Pseudo-prefixes,  2)  Pseudo-roots,  3)  Pseudo¬ 
suffixes,  and  4)  Ending  of  a  Word- utterance 
which  is  listed  in  the  Glossary. 

<P 

This  Flag  is  not  part  of  the  routine.  It  is 
introduced  for  the  convenience  of  the  reader 
to  indicate  a  Word-utterance  which  is  found 
in  the  Special  List. 

GS 

The  General  Specifications  of  this  Occur¬ 
rence  indicate  that  1)  it  is  a  Word- utterance, 
2)  its  correspondent  (T)  is  an  English  word 
retrieved  from  storage,  and  3)  it  starts  with  a 
capital  letter.  ( Cf.  the  headings  in  the  List 
of  Symbols). 

TC,. 7 

The  index  j  remains  at  1;  this  indicates 
that  the  given  Occurrence  has  given  rise  to 
only  one  Temporary  Choice. 

TC,;MS 

The  Morphological  Specifications  indicate 
that  the  Occurrence  represents  a  verb  (Vb) 
in  die  infinitive  mood  (if). 

CP 

The  Glossary  Predictions  accompany  the 
Stem  of  the  Occurrence  in  Storage.  The  num¬ 
bers  in  the  various  columns  are  Urgency 
Signals  («).  Each  of  these,  with  the  excep¬ 
tion  of  the  number  headed  by  x,  will  give 
rise  to  Foresight  Predictions  to  be  stored  in 
String  2.  Their  connotation  is  explained  in 
the  List  of  Terms,  and  will  be  clarified  at  the 
time  they  will  be  utilized. 

PS 

The  Profile  Skeleton,  although  overwritten 
in  Part  II,  is  kept  in  the  chart  for  the  con¬ 
venience  of  the  reader.  It  serves  as  a  basis 
for  determining  the  boundaries  of  the  clauses 
and  phrases  within  the  source  sentence. 

Heading 

Explanation 

Section  C 

T 

In  the  Glossary,  English  correspondents 
arc  stored  in  compact  form  somewhat  similar 

to  the  Source  Transform.  The  decomposition 
is  shown  here. 

2,  The  portion  headed  “TP”  lists  the  components  of  the 
Temporary  Profile,  resulting  from  the  iterative  process  per¬ 
formed  on  the  data  in  the  Profile  Skeleton  (PS)  involving 
the  entire  sentence. 


« 

PARTE 

|Sec. 

X] 

TP 

<? 

E 

v| 

p 

r 

1 

1  00 

Heading  , 

Explanation 

C 

The  number  1  indicates  that  the  given  Oc¬ 

currence  forms  part  of  the  first  clause. 

u 

This  flag  remains  at  the  zero  level  until  the 

Prediction  of  a  predicate  is  fulfilled. 

P 

The  zero  indicates  that  the  given  Oc- 

!  currence  is  not  part  of  a  phrase. 

3.  The  following  preliminary  steps  are  executed  by  the 
routine  prior  to  entering  the  (minor)  iterative  cycle  in¬ 
volving  the  first  String, 

a.  The  Chain  number  ( L)  is  set  to  I. 

b.  The  Hindsight  is  cleared  of  all  content. 

c.  The  Foresight  Predictions  ( FP„* )  stored  within  String 
q  are  made  by  String  q- 1.  When  q  is  equal  to  1,  FI\., 
and  FPi.s  are  made  by  the  routine  in  response  to 
signals  indicating  the  start  of  the  first  clause.  They 
predict  with  utmost  assurance  (shown  by  U  =7) 
that  a  Subject  and  Predicate  wilt  occur  within  the 
clause. 

d.  The  Selected  Profile  ( SP )  is  temporarily  set  equal  to 
the  Temporary  Profile. 
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4.  The  portion  headed  “Section  B”  indicates  how  the 
routine  attempts  to  incorporate  the  given  Occurrence  as  a 
Jink  of  a  unifying  Chain,  representing  the  sentence  struc¬ 
ture. 


part  n 


Section  B 


Fores'l  Prds.  (FPq|<) 


SE 


Morph.  Sps. 
(MS) 


V  A  G  Y  Z 


Sel.  Ch. 
(SC) 


FP 


OK 


SP 


CMP 


H  i  n  d  s  i  g  h  t 


H, 

h2 

Entry  |  Reso  1 . 

Entry 

Res 

DDI3DQDQ 

□□Oia 

□□ 

SE  Sbj  -  6  1 

-  8  2 

1  Y  m  -  4  3 
-  10  3 
-26  l 

H, 


l®l 


1  7  Sbj 

2  7  Pdc 


I  I  I 


I  0  0 


1 


7  Pdc 
7  CpI 
6  CIs 
3  CpI 
3  CpI 


vf  N 


Heading 


SC:FP 

SC:/ 


Explanation 


The  Q  and  K  under  this  heading  indicate 
that  FPl.i,  calling  for  a  Subject,  has  been  re¬ 
sponsible  for  the  Selected  Choice. 

The  index  1  indicates  that  TC, — a  verb 
in  the  infinitive — fulfilling  the  above  predic¬ 
tion,  was  selected. 


Heading 


Hindsight 

Resol. 


Explanation 


In  general,  the  TC,,  of  the  Current  String, 
and  the  newly  made  FP,«  j,,  stored  in  the 
next  String,  are  examined  to  ascertain 
whether  they  throw  any  light  on  the  perplexi¬ 
ties,  doubts,  and  conflicts  recorded  in  the 
Hindsight  columns.  For  q  —  I,  however,  the 
Hindsight  is  empty  at  this  stage  of  the  proc¬ 
ess.  ( The  resolutions  indicated  on  the  chart 
for  String  1  will,  of  course,  have  been  made 
by  subsequent  Strings.) 


L 


FP 


J2.k 


The  Chain  number  remains  unchanged  be¬ 
cause  the  SC  fulfilled  a  Foresight  Prediction. 

This  signal  is  not  part  of  the  routine.  It  is 
introduced  for  the  convenience  of  the  reader 
to  indicate  the  number  of  the  String  which 
caused  the  deletion  of  the  F’oresight  Predic¬ 
tion  alongside  of  which  the  t  appears. 

The  r  accompanying  FP,  s  is  encircled  to 
indicate  (to  the  reader  only)  that  the  pre¬ 
diction  is  amplified  within  String  2  in  the 
light  of  the  chosen  Subject. 

Foresight  Predictions,  to  be  stored  in  String 
2,  are  made  on  the  basis  of  the  Selected 
Choice  in  String  I.  First,  and  most  urgent,  is 
the  amplification  of  the  Prediction  of  a 
predicate.  There  were  no  general  grammar 
Predictions  because  in  our  routine  a  verb 
does  not  yield  such. 

The  remaining  four  Predictions  came  from 
the  Glossary  information,  accompanying  the 
Temporary  Choice  selected  as  SC  for  String 
1.  They  predict  that  with  utmost  necessity  a 
direct  object  (in  the  form  of  either  a  single 
word  in  the  accusative  case  or  of  a  clause) 
will  occur  and  that  a  slight  probability  exists 
for  the  appearance  of  complements  (i.e.  in¬ 
direct  objects)  in  the  dative  and/or  instru¬ 
mental  case,  each  followed  by  a  suitable 
preposition  (e  Flag  =  1). 

Since  the  x  Flag  in  String  1  is  unity,  a 
suitable  signal  will  be  set  to  indicate  that  the 
next  Positional  Preposition  will  govern  the 
locative  case,  unless  the  signal  is  turned  off 
by  an  x  equal  to  zero  in  a  subsequent  String. 


SP 


Hindsight 

Entries 


H,  Entries 


Hs  Entries 


Ha  Entries 


In  general,  the  doubts  recorded  in  the 
selection  of  the  components  of  the  Profile  are 
resolved  in  fight  of  the  SC  and  of  the  Hind¬ 
sight  resolution.  For  q  —  I,  this  is  seldom 
necessary. 

This  information  is  derived  by,  and  stored 
in,  String  q  to  be  investigated  by  subsequent 
Strings. 

The  unflagged  Entries  in  String  1  indicate 
that  the  SC  was  a  Doubtful  Choice.  They  are 
to  be  interpreted  as  follows: 

First  Entry 

a.  The  j  is  blank  to  indicate  that  the  Entry 
concerns  the  SC. 

b.  The  SE  indicates  that  this  part  of  the 
SC  contains  the  doubtful  item. 

c.  The  Sbj  gives  the  specific  Element  of 
the  SC  which  is  doubtful. 

Second  Entry 

a.  The  /  indicates  that  TCi  may  be  gov¬ 
erned,  not  by  FPj.i  as  chosen,  but  by 
some  future  FP. 

b.  The  Y  indicates  the  part  of  some  future 
FP  which  may  govern  TCi. 

c.  The  m  indicates  that  TCi  may  be  the 
fulfillment  of  a  Prediction  for  a  verb  in 
the  infinitive  mood. 

The  one  Temporary  Choice  in  String  1, 
necessarily  taken  as  the  SC,  fulfilled  (doubt¬ 
fully)  Fi.i  but  not  F12.  Therefore,  H2  has  no 
Entry  for  String  1, 

When  a  String  has  only  one  TC,  there  can 
be  no  unused  Choices. 
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5.  The  portion  headed  "Section  C”  indicates  the  order 
and  form  in  which  the  target  correspondent  will  appear  in 
the  translated  sentence. 


_ Section  C _ 

Tronslotiort 


Heading 


The  order  in  which  the  correspondents  will 
appear  in  the  printed  translation  in  general 
follows  closely  the  order  of  the  original  Oc¬ 
currences.  On  occasion,  a  particular  Selected 
Choice  ( SC )  and/ or  a  Hindsight  resolution 
may  effect  a  deviation  from  the  sequence. 
The  routine  in  Section  B  indicates  the  proper 
order  (R)  of  every  correspondent  in  the 
printed  sequence. 

A  subroutine  makes  these  crude  Pretarget 
Inserts.  They  are  printed  in  parentheses. 

The  Pidgin  Endings  assume  all  English 
words  to  be  regular. 


PART  n 


WTryn 

Section  B 

Section  C 

Preds. 

3P) 

1 

SP 

H  i  n  d  s  i  g  h  1 

Translation 

.  H' . . 

Hz 

Entry 

Resol. 

EntfY 

IIIDI3 

-*DBB 

□□BO  J 

□0 

BOB 

DDIO 

QHBfl 

□□Bra 

HE 

jn 

R 

PI  ]  T 

B 

i  i 

vi  :  o 

i 

1  7  Sbj 

2  7  Pete 

■ 

T 

1  0  0 

SE  Sbj  -  6 

,  -  9  „  2 

1  Y  m  -  4  3 

-  10  3 

-26  ! 

■ 

1 

(To)  1  de¬ 

mon  - 
!  str- 

;  ofe 

1 

1  1 

po  :  i 

1 

24 

9 

I  7  Pdc 

I I  S»i 

4  3  CpI 

5  3  Cel 

m 

1 

2 

3 

?  with- 

*  out 

1 

1  7  CpI 

■ 

1 

(the) 

means 

:  x 

i 

H 

4  2  1  SE  Mst  +  5 

2  2  2  A  im 

Y  o  -  6 

4 

(of) 

[Irans- 

tzend- 

ent] 

1 

:  x 

5 

5 

1  1  Mst 

2  0  CpI 

©  q  s 

9 

1 

5  i  1 

1 

1  0  1 

5 

[onol- 

izj 

i 

:  x 

6 

6 

r  i  cpi 

2  0  Mfr 

9 

Mgs 

1 

2  2  2 

i 

1  0  0 

(the) 

bas¬ 

ic 

:  x 

7 

7 

1  1  Mst 

2  0  Cpi 

im  Op 

g 

1 

7  1  3 

i 

1  0  0 

[teor- 

em] 

i  t 

:  x 

8 

a 

1  1  Cpi 

2  0  Mfr 

9 

im  F  o  p 

1 

8  1  1 

1 

1  0  0 

3 

(of) 

calcul¬ 

ation/ 

us 

i  i 

;  x 

9 

9 

25 

1  1  Cpi 

2  0  Mfr 

3  3  Cpi 

g 

Im  N  g  s 

i 

' 

9  1  1 

1 

1  0  0 

C»  ;  x 

10 

10 

26 

1  1  Cpi 

2  0  Mfr 

3  3  Cpi 

9 

im  F  g  p 
m 

1 

0  0  1 

1 

1  0  0 

Ps  C]  *  12 

2  Ps  Av  +  12 

c*  3  x 

0  0  1 

i 

1  0  0 

2  X  1 

1  Mst 
0  Cpi 

2  2  2  6 

i 

1  0  0 

0  X 

13 

13 

1 

2 

im  o  p 

9 

1 

13  1  3 

t 

i  0  0 

ap¬ 

plic¬ 

ation 

0  X 

14 

[4 

1  1  Cpi 

2  0  Mfr 

g 

im  N  o  p 

1 

- 

14  1  1 

1 

1  0  0 

1 

(of) 

them 

1 

continued  on  next  page 
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Note 

no. 

9 

Headings 

Explanation 

1 

2 

3 

2 

TP 

FPi.» 

SC 

The  preposition  starts  a  new 
phrase,  as  indicated  by  P  =  1. 

The  first  Prediction  is  FPi.t 
revised  in  the  light  of  the  chosen 
Subject.  The  number  3  in  the  Y 
column  indicates  the  3rd  person. 
The  remaining  FP  were  culled 
from  GP  in  String  1.  The  Flag  e 
indicates  that  a  crude  PI  will  be 
inserted  before  a  T  which  fulfills 
a  predicted  dative  or  instrumental 
Complement 

The  Q  and  K  are  zero,  because 
a  preposition  is  an  Unpredictable 
Choice. 

4 

3 

TG) 

The  j  is  set  to  zero  to  indicate 
that  the  information  will  be  com¬ 
mon  to  all  the  TC,  listed  in  String 
3. 

The  Grammar  Essential  g  is 
not  used  for  nouns  or  adjectives, 
since  the  routine  always  makes 
the  prediction  of  a  Complement 
in  the  genitive  for  these  parts  of 
speech.  In  Russian,  adjectives  may 
act  as  nouns. 

When  the  Complement  of  a 
preposition  is  fulfilled,  the  other 
TC>  are  disregarded  and  not  en¬ 
tered  in  Hindsight. 

5 

6 

GP 

Hi,  H, 

7 

8 

9 

10 

4 

Occurrence 

TC,:G 

TP  and  SP 

FP 

Tire  prefix  and  Erst  suffix  are 
non-Slavic. 

The  encircled  F  means  Mascu¬ 
line  or  Neuter. 

The  routine  resolves  the  Un¬ 
certainty  Signal  X,  in  TP,  con¬ 
cerning  the  phrase  number  P. 

The  first  two  Predictions  are 
made  by  the  routine  anent  the 
noun  in  the  preceding  String. 
They  were  deleted  simultaneously 
because,  whether  fulfilled  or  not, 
FP’s  with  Urgency  1  or  0  are  not 
kept  beyond  one  String.  (Cf.  Ur¬ 
gency  code  in  List  of  Terms).  The 
third  FP  throws  some  light  on  the 
second  query  in  Hi  but  does  not 
resolve  it,  as  indicated  by  the 
negative  sign  in  h. 

11 

Hi 

Two  alternate  choices  are  noted 
here: 

First  conflict. 

a.  The  SE  indicates  the  part  of 
some  future  SC  which  may 
throw  light  on  the  conflict. 

b.  The  Mst  indicates  the  fact 
that  the  conflict  will  be  re¬ 
solved,  if  a  future  SC  will 
supply  a  Master  for  the  cur¬ 
rent  SC. 

Second  conflict. 

First  Entry 

a.  The  A  indicates  the  part  in  a 
future  SC  which  may  throw 
light  on  the  conflict. 

Note 

no. 

9 

Headings 

Explanation 

12 

E 

b.  The  tm  indicates  the  fact 
that  the  conflict  will  be  re¬ 
solved,  if  a  future  SC  will 
supply  an  inanimate  Master 
for  the  current  SC, 

Second  Entry. 

a.  The  Y  indicates  the  part  in 
a  future  SC  which  may 
throw  light  on  the  conflict. 

b.  The  a  indicates  the  fact  that 
the  conflict  may  be  resolved, 
if  no  future  SC  fulfills  the 
e-xpectation  of  a  Complement 
in  the  accusative  case. 

This  is  the  Pidgin  Ending  for 
all  non- Slavic  adjectives. 

13 

14 

15 

16 

5 

TC:C 

FP 

SC  and 

IT  resol. 

Ha  Entry 

The  Pseudo-suffix  H3  with  a 
non-Slavic  root  will  be  taken  as 
denoting  a  masculine  noun.  Any 
required  non-Slavic  feminine  or 
neuter  noun  with  this  Pseudo-suf¬ 
fix  will  be  included  in  the  Glos¬ 
sary. 

These  Predictions  are  supplied 
by  the  routine  anent  the  adjective 
in  the  preceding  String. 

The  Master  prediction  was  sat¬ 
isfied,  thus  resolving  the  first 
Entry  in  Ha  (Cf.  Note  11).  Un¬ 
fortunately  the  Occurrence  was  a 
non-Slavic  word  and  not  repre¬ 
sented  in  our  Glossary.  There  was 
no  information  about  its  anima¬ 
tion,  and  therefore  the  second  En¬ 
try  can  not  be  resolved. 

The  Entry  of  the  form  5.2.1,  to 
indicate  that  FP»-  is  also  satisfied 
by  TCi,  is  omitted  for  the  follow¬ 
ing  reason:  When  a  Master  pre¬ 
diction  is  satisfied,  there  is  at 
present  no  way  to  resolve  this 
ambiguity,  since  considerations  of 
semantics  and  of  context  would 
be  involved. 

17 

18 

19 

6 

SP  and  FP»  a 

TCi  and 

Hi  resol. 

SC  and  H* 

Since  none  of  the  Predictions 
made  by  the  Strings  in  phrase  1 
are  fulfilled  by  the  current  String, 
whereas  a  Prediction  in  the  main 
clause  is,  the  phrase  number  P 
is  set  to  zero  again.  The  unfulfilled 
Prediction  in  Phrase  1,  namely 
FP«,«,  which  bears  a  low  U,  is  de¬ 
leted. 

Since  TCt  could  serve  as  Sub¬ 
ject,  the  fact  is  noted  in  the  reso¬ 
lution  column  of  the  first  Entry  in 
Hindsight  Hi  (in  String  1). 

The  SC  throws  partial  light  on 
the  conflict  in  the  Hi  column. 
(Cf.  Note  11.) 

20 

7 

TC:G 

The  suffix  EM  with  a  non-Slavic 
root  will  be  considered  as  belong¬ 
ing  to  feminine  nouns.  (Cf.  Note 
13.) 
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Note 

no. 

<? 

Headings 

Explanation 

21 

TC  and 

H.  resol. 

Although  TC:  could  serve  as 
the  Subject,  no  note  is  made  of 
this  fact  in  the  resolution  of  the 
first  Entry  in  Hi,  because  it  would 
be  the  Master  of  TC,  in  the  previ¬ 
ous  String,  which  has  already 
been  recorded.  (Cf.  Note  18.) 

22 

8 

TC/  and 

Hi  rcsol. 

Partial  light  is  thrown  on  the 
first  Entry  in  Hi. 

23 

TCi  and  H, 

This  choice  is  pigeonholed. 

24 

T 

There  is  no  way  to  resolve  the 
polysemy  at  the  present  stage  of 
our  investigation. 

25 

9 

EP« » 

This  prediction  deletes  F1V»,  as 
explained  in  the  List  of  Terms 
under  the  heading  "Urgency 
Code”, 

26 

10 

_ 

TC  and  H, 

The  iterative  scheme  in  Section 
A  of  Part  II  established  the  fact 
that  this  String  did  not  start  a  new 
clause  (since  the  old  clause  still 
had  the  Status  zero).  The  current 
String  could  therefore  represent 
cither  a  coordinative  conjunction 
binding  two  related  SC,  or  an  ad¬ 
verb.  Since  both  Choices  are  Un¬ 
predictable,  they  arc  recorded  as 
Doubtful  (unfiagged  Entries  in 
Ht). 

27 

This  String  represents  an  Idiom, 
which  constitutes  a  single  Word- 
utterance. 

28 

12 

b 

The  Backward  Flag  was  placed 
by  the  Profile  routine  in  Section 
A  of  Part  11.  It  indicates  that,  in¬ 
stead  of  examining  the  FP's,  the 
routine  will  scan  the  previous  SC 
in  reverse  order  to  establish  a 
new  link  in  the  structural  Chain. 

29 

SC;B 

The  entry  in  this  column  indi¬ 
cates  that  the  SC  in  String  6  is 
conjunctively  related  to  the  cur¬ 
rent  String,  and  the  true  nature  of 
String  10  is  thus  established,  as 
indicated  hy  the  resolutions  in  Hi. 

30 

14 

GS:6 

The  source  word  might  under 
certain  conditions  he  treated  like 
a  prepositive  possessive  modifier 
rendered  in  English  bv  "their". 

31 

TC:D 

Thi?  number  3  indicates  the 
person. 

32 

16 

FP 

These  Predictions  were  caused 
by  the  change  in  the  clause  num¬ 
ber  C  of  the  previous  String. 

33 

b;  B;  H, 

Similar  entries  have  been  ex¬ 
plained  in  Notes  28.  29,  and  for 
String  1. 
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Note 

no. 

'1 

Headings 

Explanation 

34  : 

23 

TP:U 

The  Profile  routine  indicated 
that  the  current  String  must  be  fol¬ 
lowed  by  a  New  String  represent¬ 
ing  a  copulative  verb.  The  Status 
Flag  e  was  set  to  l  in  order  to 
effect  the  proper  insertion.  (Cf. 
Note  35.) 

35 

24 

GS:6 

This  String  was  inserted  be¬ 
cause  of  the  v  Flag  iu  the  previ¬ 
ous  String  (Cf.  Note  34.) 

36 

25 

FP 

These  Predictions  were  made 
hy  the  routine  anent  the  copula¬ 
tive  verb  in  the  preceding  String. 

37 

27 

TC, 

The  locative  case  cannot  be 
considered,  since  it  is  governed 
only  by  a  preposition. 

38 

I. 

The  Chain  is  broken,  as  the 
routine  cannot  account  for  the 
dative.  (The  FP/,,  cannot  be 
considered,  since  the  two  Strings 
are  not  in  the  same  phrase.) 

39 

»,:/ 

The  Entry  is  flagged,  since  the 
Choice  is  Unexpected. 

40 

R 

The  rearrangement  in  the  target 
order  will  be  explained  in  Note 
47. 

41 

28 

b,  B,  R 

Cf.  Notes  28,  29,  47. 

42 

29 

i  FP*  , 

This  prediction  throws  partial 
light  on  the  H,  Entry  in  String 
27.  The  r  is  indicated  in  the  light 
of  Note  47. 

43 

PI 

Since  the  noun  is  animate,  only 
one  connotation  of  the  preposition 
is  used. 

44 

30 

CS:6  and 
TC:A 

This  part  is  used  in  obtaining 
the  signal  pic  ( proper  word ) . 

45 

31 

FI’ 

The  appearance  of  an  initial 
calls  strongly  for  another  proper 
noun,  flence  the  inversion  of  the 
usual  order  of  Predictions  after  a 
noun. 

46 

CS;6  and 
TC:A 

The  fact  that  the  capitalized 
word  does  not  start  a  sentence 
causes  A  to  become  pw. 

47 

Since  no  other  explanation  of¬ 
fered  itself  for  the  H,  resolution 
in  27,  the  previous  explanation  is 
accepted,  tire  Entry  is  deleted, 
and  L  is  reduced  by  one.  This  fact 
also  causes  the  rearrangement  of 
the  two  neighboring  R's.  The  Ts 
for  these  two  strings  reflect  the 
result  of  this  rearrangement. 

End  of  Iteration  1.  Since  no  FP’s  with  U  of  6  or  7  remain 
and  the  chain  number  L  is  equal  to  1,  no  other  itera¬ 
tion  is  necessary.  The  translation  is  printed  out  as  indi¬ 
cated  in  the  last  three  columns  of  the  chart,  followed 
by  the  tally  of  unresolved  Hindsight  Entries. 
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Appendix  I.  List  of  Terms 

address,  extended  (a).  The  locator 
of  an  item  on  an  external  storage 
medium.  Its  form  depends  on 
the  machine  used.  On  the  IBM 
704  it  consists  of  ( 1 )  a  num¬ 
ber,  ( 2 )  block  number,  (3) 
serial  position  of  item  in  the 
block. 

chain.  A  group  of  consecutive  SC’s 
characterized  by  the  same  chain 
number  (L). 

CHOICE, 

doubtful.  A  Selected  Choice  which 
entails  the  recording  of  an  un¬ 
flagged  query  in  Hindsight  H,. 
final  (FC).  The  Selected  Choice  in 
the  last  iterative  cycle. 
predictable.  Not  belonging  to  class 
of  Unpredictable  Choices. 
selected  (SC).  A  Temporary  Choice 
selected  as  a  link  in  the  Chain 
during  a  current  iteration. 
temporary  (TC).  A  grammatical 
interpretation  of  a  Source  Oc¬ 
currence. 

unexpected.  A  Predictable  Choice 
which  does  not  agree  with  any 
of  the  Foresight  Predictions  (FP). 
unpredictable.  A  TC  containing  one 
of  the  following  parts  of  speech; 
( 1 )  a  conjunction  ( 2 )  an  adverb 
(3)  a  particle,  and  (4)  a  prepo¬ 
sition;  or  else  (5)  it  is  a  punc¬ 
tuation  mark. 

code,  urgency.  Cf.  Urgency. 

element,  sentence.  Cf.  Sentence. 

ENDrNG, 

source.  The  true  inflectional  ending 
of  a  word  in  the  source  text. 
pidgin  (E).  Regular  ending  af¬ 
fixed  to  stein  of  target  corre¬ 
spondent,  regardless  of  correct 
usage. 

ENTRY, 

glossary.  A  complete  set  of  Glos¬ 
sary  items  corresponding  to  a 
source  Stem. 

hindsight.  A  record  of  an  Unex¬ 
pected  String,  a  Doubtful  Choice, 
or  a  surplus  Temporary  Choice 
(cf.  Hindsight). 

essential,  grammar  (GE).  A  gram¬ 
matical  form  called  for  by  a 
glossary  Prediction  (e.g.  an  ac¬ 
cusative  called  for  by  a  transi¬ 
tive  verb).  Each  type  of  GE  has 
a  separate  location  in  the  string 
reserved  for  it.  A  GE  is  predicted 
by  storing  an  Urgency  Signal  in 
this  location. 

extended  Address.  Cf,  Address. 

file,  sorting.  The  internal  space  al¬ 
lotted  for  sorting  the  Extended 
Addresses  (a). 


flag.  A  binary  digit  (i.e.  either  a  0- 
or  1-bit). 

Backward  (h).  A  1-bit  alerting  the 
machine  to  examine  the  forego¬ 
ing  Selected  Choices  in  order  to 
establish  the  linkage  of  the  cur¬ 
rent  String. 

Chnin  (f).  A  1-bit  accompanying 
a  Hindsight  Entry  in  H,  to  re¬ 
cord  an  Unexpected  Choice. 

Correspondence  (c).  A  bit  indicat¬ 
ing  whether  the  String  contains 
target  correspondents  or  not. 

Insert  (e).  A  1-bit  accompanying 
a  Foresight  Prediction  to  indi¬ 
cate  that  a  suitable  English  pre¬ 
position  is  to  be  used  as  a  Pre¬ 
target  Insert  (PI). 

Locative  (x).  A  1-bit  in  Glossary 
Predictions  to  indicate  that  the 
locative  case  is,  1 )  if  the  String 
represents  a  preposition,  one  of 
the  cases  governed  by  the  pre¬ 
position;  or  2)  if  the  String  does 
not  represent  a  preposition,  to  be 
used  after  the  next  Positional 
Preposition  encountered  in  the 
sentence, 

Status  (v).  A  bit  indicating  whether 
the  predicate  of  the  current 
clause  has  turned  up  or  not. 

Word  ( u; ) .  A  hit  indicating  whe¬ 
ther  the  String  represents  a 
word  or  not. 

general  Specifications.  Cf.  Specifi¬ 
cations. 

c  lossary.  The  externally  stored 
source-to-targct  dictionary  used 
by  the  MT  group  at  NBS.  It  is 
stored  in  a  greatly  compacted 
form  and  contains  diacritical 
material  designed  to  aid  in  the 
syntactic — and  to  a  small  degree 
in  the  semantic — analysis  of 
source  sentences. 

grammar  Essential,  Cf.  Essential. 

hindsight.  Internal  space  allotted  for 
storing  in 

Hi,  Entries  concerning  Unexpected 
Choices  or  Doubtful  Choices. 

IT,  Temporary  Choices  (TC), 
other  than  the  Selected  one, 
which  fulfill  Foresight  Predic¬ 
tions  ( FP ) . 

H:,,  Temporary  Choices  (TC), 
other  than  the  Selected  one, 
which  do  not  fulfill  any  of  the 
Foresight  Predict  ions  (FP). 

identification  Tag.  Cf.  Tag. 

INSERT, 

target  (TI).  A  target  correspond¬ 
ent  incorporated  in  a  New 
String. 


pretarget  (PI).  A  word  inserted 
before  a  target  correspondent, 
list.  Internally  stored  one-to-one  cor¬ 
respondences,  yielding  for  each 
of  the 

endings,  an  address  f)  for  each 
Source  Ending,  enabling  the  ma¬ 
chine  to  find,  subsequently,  the 
corresponding  morphological  in¬ 
formation  in  the  Table  of  End¬ 
ings. 

pseudo-prefixes,  a  6-bit  character, 
as  a  substitute  for  each  Pseudo¬ 
prefix. 

pseudo-roots,  an  Extended  Address 
a  leading  to  the  location  of  the 
first  externally  stored  Satellite  of 
each  Pseudo-root. 

pseudo-suffixes,  a  6-bit  character, 
as  a  substitute  for  each  pseudo- 
suffix. 

symbols,  a  definition  (to  be  found 
in  Appendix  11). 

terms,  a  definition  (to  be  found  in 
Appendix  I). 

special  words,  dictionary  informa¬ 
tion,  The  arguments  in  this  List 
consist  of  conjunctions,  preposi¬ 
tions,  particles,  idioms,  abbrevia¬ 
tions,  some  adverbs,  and  words 
with  ambiguous  endings. 
morphological  specifications.  Cf. 
Specifications. 

occurrence,  source.  A  combination 
of  one  or  more  characters  in  a 
source  text.  It  may  represent; 
( 1 )  a  Word-utterance  ( 2 )  a 
punctuation  mark  or  a  set  of 
such;  (3)  a  symbol  or  a  set  of 
such;  (4)  a  diagram  or  a  set  of 
such;  etc. 

positional  preposition.  Cf.  Preposi¬ 
tion. 

PREDICTIONS, 

foresight  (FP),  information  con¬ 
cerning  TC’s  which  are  expected 
to  occur  somewhere  in  the  sen¬ 
tence  under  consideration.  Such 
information  is  derived  either 
from  rules  of  grammar  incorpo¬ 
rated  in  the  machine  instructions, 
or  from  Glossary  Predictions,  or 
from  the  Temporary  Profile. 

glossary  (GP),  partial  information, 
retrieved  from  the  Glossary  or 
Special  List  and  stored  as  part 
of  a  String,  indicating  what  kinds 
of  TC’s  are  expected  to  occur 
somewhere  before  or  after  the 
Current  SC  in  the  same  sentence. 
A  GP  is  recorded  by  assigning 
an  Urgency  Signal  to  a  Gram¬ 
mar  Essential.  One  String  may 
contain  several  GP’s. 
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PREPOSITION, 

positional.  One  of  a  set  of  Russian 
prepositions  which  govern  either 
the  accusative  or  locative  case. 

profile.  The  sequence  of  sets  of 
designations,  incorporated  in 
each  String  of  a  sentence,  which 
may  throw  light  upon  the  ranges 
of  its  clauses  and  phrases. 

Selected  (SP).  The  Temporary 
Profile,  revised  during  an  itera¬ 
tive  cycle  in  tire  syntactic  inte¬ 
gration  process. 

Skeleton  (PS).  The  initial  stage  of 
a  Profile,  which  bears  one  signal 
for  each  Occurrence,  indicating 
the  latter’s  significance  in  deter¬ 
mining  the  clauses  and  phrases 
of  the  current  sentence. 

Temporary  (TP).  A  sequence  of 
sets  of  four  preliminary  desig¬ 
nators  assigning  a  rough  clause 
(C)  and  phrase  (P)  numher,  as 
well  as  a  Status  ( e ) — and  possi¬ 
ble  Backward,  (b) — Flag,  to 
each  Occurrence. 

pseudo- root .  That  portion  ( if  any ) 
of  a  word-Occurrencc  remaining 
after  the  Pseudo-prefixes,  End¬ 
ing,  and  Pseudo- suffixes  are 
stripped  off. 

pseudo-prefix.  One  of  a  set  of  com¬ 
binations  of  source  letters  which 
are  frequently  found  at  the  be¬ 
ginning  of  words  in  the  source 
language. 

pseudo-suffix.  One  of  a  set  of  com¬ 
binations  of  source  letters  which 
arc  frequently  found  before  the 
Source  Ending  of  words  in  the 
source  language. 

satellite  of  a  Pseudo-root.  A  Glos¬ 
sary  Entry  listing  the  Transform 
of  a  Source  Stem  which  contains 
the  Pseudo-root  in  question. 

sentence  element  (SE).  One  of  the 
following  ingredients  of  a  sen¬ 
tence:  Subject,  Predicate,  Com¬ 
plement,  Modifier,  Master,  Clause 
and  Phrase. 


Appendix  II.  List  of  Symbols 

a  Extended  Address 

A  Heading  in  TCj  and  FP 

i  Accusative  case 

Aj  Adjective,  a  Ps 

am  Animate 

Av  Adverb,  a  Ps 

i  Address  of  argument  in  Table 

of  Endings 

3  Signal  in  SC  to  indicate  the 
String  where  concatenation  xvas 


SIGNAL, 

peculiarity  (8).  An  indicator  of 
some  peculiar  nature  of  an  Oc¬ 
currence,  e  g.  that 
it  is  a  capitalized  word, 
it  is  an  initial, 
its  root  is  not  listed,  etc. 

uncertainty  (X).  Used  instead  of 
a  clause  or  phrase  number  in  a 
Profile,  when  the  determination 
of  that  number  is  not  possible. 

urgency  ( u ) ,  One  of  the  numbers 
0  to  7,  indicating  the  probability 
of  a  Glossary  Prediction  (GP), 
used  to  form  Urgency  Code  U 
in  FP,  according  to  the  follow¬ 
ing  relation  between  u  and  U: 

0  —  1;  1-3;  2-5;  3  —  7. 

A  u  signal,  4  units  higher  than 
the  above,  indicates  an  alternate 
prediction  of  a  clause. 
sorting  file.  Cf.  File. 
specifications, 

entry  ( ES ) .  Signals  in  Hi  and  H» 
to  specify  the  type  of  query. 

general  ( GS ) .  Designators  in  a 
String,  consisting  of  a  Word  (in) 
and  Correspondence  ( c )  Flags, 
as  well  as  a  Signal  of  Peculiarity 
(8). 

morphological  ( MS ) ,  Designators 
in  the  TC/,  FP  and  Hi  of  a 
String,  which  deal  with  the 
grammatical  interpretation  of  the 
original  Occurrence. 
stem.  The  portion  of  a  word  remain¬ 
ing  after  the  ending  is  removed. 
siring.  The  information,  replacing 
the  original  Occurrence,  which 
is  available  to  the  routine  dur¬ 
ing  the  process  of  syntactic  inte¬ 
gration. 

New.  A  String  which  is  inserted 
during  the  process  of  syntactical 
integration. 

Old.  A  String  which  is  available 
at  the  beginning  of  the  syntactic 
process. 

table  of  Endings.  A  tabulation  of 
the  morphological  possibilities  of 
each  Source  Ending, 


established  during  backward 
examination 

b  Backward  Flag 

C  For  all  Strings  other  than  the 

first  in  a  sentence,  C  is  the 
clause  number  in  the  Profile. 
For  the  first  string  of  a  sen¬ 
tence  (since  all  sentences  start 
with  clause  number  1 ),  we  shall 
use  this  symbol  as  a  code: 


tag,  identification  ( f) .  A  serial  num¬ 
ber  attached  to  each  Source  Oc¬ 
currence  of  a  text  sentence.  It 
consists  of  ( 1 )  page  number, 
(2)  sentence  number,  and  (3) 
Occurrence  number. 
transform,  source  (a).  A  contrac¬ 
tion  representing  the  Stem  of  a 
Source  Occurrence  in  the  ex¬ 
ternal  memory. 

urgency  code  (U).  One  of  the  num¬ 
bers  0  to  7,  connoting  the  prob¬ 
ability  of  a  Foresight  Prediction 
(FP)  as  follows; 

7,  must  occur  sometime 
5,  very  likely  to  occur  sometime 
3,  may  occur  sometime.  An  FP 
bearing  this  U  is  erased  by  a 
subsequent  FP  identical  to  it 
1,  will  be  the  next  Choice  or 
won’t  occur  at  all.  It  is  erased 
after  the  next  SC. 

A  code  of  6, 4, 2,  and  0,  indi¬ 
cates  the  same  degree  of  Ur¬ 
gency  as  7,  5,  3  and  1  respec¬ 
tively.  Moreover,  the  even-num- 
bered  codes  denote  an  FP  alter¬ 
nate  to  the  last  preceding  odd- 
numbered  FP  in  the  same  String 
(c.g.  successive  U’s  of  5,  4,  2 
indicate  that  the  second  and 
third  predictions  are  alternates 
for  the  first,  so  that  if  one  of  the 
three  occurs,  all  three  could  be 
erased).  An  FP  with  U  >  4  is 
not  erased  until  the  end  of  the 
iterative  cycle,  unless  it,  or  one 
of  its  alternates  are  satisfied.  An 
FP  with  U  =  6  or  U  =  7,  left 
unsatisfied  at  the  end  of  a  cycle, 
calls  for  another  iteration. 
urgency  signal — Cf.  Signal 
utterance,  word — Cf.  Word 
word — utterance  (W).  One  word  or 
a  set  of  consecutive  words — in 
complete  or  abbreviated  form — 
used  as  an  entity  ( c.g.  an  ini¬ 
tial,  compound  word,  an  idiom, 
etc. ) . 


1,  declarative  sentence 

2,  interrogative  sentence 

3,  exclamatory  sentence, 
etc. 

This  is  possible  because  the 
Temporary  Profile  is  obtained 
as  a  result  of  an  iterative  rou¬ 
tine,  and  the  nature  of  the 
sentence  is  known  before  Part 
II  is  undertaken. 
c  Correspondence  Flag 
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Cap  Word  starts  with  capital,  in  8 

cd  Coordinate 

Cj  Conjunction,  a  Ps 

Cls  Clause,  an  SE 

co  Clause  opener,  in  PS 

cp  Copulative 

Cpl  Complement,  an  SE 

Cpr  Compound  root,  in  8 

cw  Coordinate  word,  in  PS 

.1  Source  Transform 

6  Signal  of  Peculiarity 

D  Heading  in  C/ 

d  Dative  case 

dm  Demonstrative 

E  Pidgin  Ending 

e  A  I -bit  to  indicate  that  an 

English  preposition  is  to  be 
used  as  a  PI 

ES  Entry  Specifications 

es  End  of  sentence,  in  PS 
Esp  End-of-sentence-period,  in  6 

F  Feminine  gender 

/  Chain  Flag 

FC  Final  Choice 

FP  Foresight  Prediction 

C  Gender:  M,  F,  and  N.  An  en¬ 
circled  gender  indicates  the 
tioo  other  genders 
g  Genitive  case 

GE  Grammar  Essential,  such  as  g, 

a,  d,  f,  x,  m 

GP  Glossary  Predictions 

GS  General  Specifications 

Hi  Storage  of  queries  anent  Un¬ 

expected  and  Doubtful  Choices 
Hs  Storage  of  alternate  predicted 

TC 

Hj  Storage  of  surplus  unpredicted 

TC 

h  (for  convenience  of  the  reader 

only)  Signal  for  type  of  resolu¬ 
tion  in  Hindsight:  minus  for 
partial,  plus  for  complete 

/  Instrumental  case 

id  Indicative  mood 

Idm  Idiom,  in  6 

if  Infinitive  mood 

im  Inanimate 

Ini  Initial  (a  capital  letter  fol¬ 

lowed  by  a  period),  in  8 

J  A  number  indicating  the  }"* 

TC  in  String  Q 

j  A  serial  number  of  TC  within 

a  String  q 


K  A  number  indicating  the  k'h 

FP  in  String  Q 

k  A  serial  number  of  an  FP 

within  a  String  q 

L  Chain  number,  indicating  the 

degree  to  which  the  syntactic 
integration  lacks  cohesion 
l  locative  case 

M  Masculine  gender 
m  A  Flag,  indicating  whether  or 
not  a  verb  in  the  infinitive  is 
predicted 

Mfr  Modifier,  an  SE 
Mst  Master,  an  SE 

N  Neuter  gender 
n  Nominative  case 
Nn  Noun,  a  PS 
Np  Not  part  of  speech 
Nsr  Non-Slavic  root,  in  8 

<t>  (for  convenience  of  the  reader 
only)  A  1-bit  to  show  that  the 
W  was  found  in  the  Special 
List 

P  Phrase  number  in  a  Profile 
p  Plural  number 
pc  Postpositive  copulative  implied, 
in  PS 

Pdc  Predicate,  an  SE 

PI  Pretarget  Insert 

Pn  Pronoun,  a  PS 

pn  Personal 

po  Phrase  opener,  in  PS 

Pp  Preposition,  a  Ps. 

Pr  present  tense 

PS  Profile  Skeleton.  A  set  of  sig¬ 

nals,  in  addition  to  punctuation 
marks,  such  as 
co  clause  opener 
cw  coordinate  word 
es  end  of  sentence 
po  phrase  opener 
pc  postpositive  copulative 
implied 
vf  verb  finite 
vi  verb  infinitive 
Ps  Part  of  speech 

ps  Positive  degree 

pv  Passive  voice 

pw  Proper  word 

Q  Running  coordinate  for  q 

q  The  serial  number  of  the 

Strings 


R  Target  order  number,  indicat¬ 

ing  sequence  in  which  the 
English  correspondents  will  be 
printed  out 

r  (for  convenience  of  the  reader 

only),  indicator  of  the  String 
in  the  consideration  of  which 
a  given  FP  is  deleted.  An  en¬ 
circled  (?)  indicates  the  String 
where  the  FP  is  revised, 
rl  Relative 

S<  Storage  space  in  internal  mem¬ 
ory  allotted  to  information 
about  Occurrence  t 
s  Singular  number 

Sbj  Subject,  an  SE 

SC  Selected  Choice 

SE  Sentence  Element,  in  an  FP 
Sis  String  inserted  by  syntax,  in  8 
SP  Selected  Profile 

Spf  Special  possessive  form,  in  8 

T  Target  correspondent 

t  Identification  Tag  of  Occur¬ 

rence 

TC  Temporary  Choice 

TI  Target  Insert 

tl  Title 

TP  Temporary  Profile 

ts  Tense 

U  Urgency  Code,  in  FP 

u  Urgency  Signal,  in  TC 

V  Heading  in  FP 

l)  Status  Flag 

Vb  Verb,  a  Ps 

vi  Verb  infinitive,  in  PS 

vf  Verb  finite,  in  PS  and  under 
heading  V  in  FP 

W  Word- utterance 

w  Word  Flag 

X  Signal  of  Uncertainty,  in  PS 
x  A  1-bit  to  indicate  that  the  lo¬ 
cative  case  is  governed  either 
at  once,  if  the  TC  represents  a 
preposition,  or  after  the  next 
Positional  Preposition  (which 

cf.  in  List  of  Terms). 

Y  A  heading  in  TC  and  FP 

y  Count  of  Pseudo-prefixes  in  a 
word-Occurrence 

Z  A  heading  indicating  grammar 

plurality 

z  Count  of  Pseudo-suffixes  in  a 

word-Occurrence . 
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